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Abstract 

We apply nonperturbative variational techniques to a relativistic scalar field theory in 
which heavy bosons ("nucleons") interact with light scalar mesons via a Yukawa cou- 
pling. Integrating out the meson field and neglecting the nucleon vacuum polarization 
one obtains an effective action in terms of the heavy particle coordinates which is nonlocal 
in the proper time. As in Feynman's polaron approach we approximate this action by 
a retarded quadratic action whose parameters are to be determined variationally on the 
pole of the two-point function. Several ansatze for the retardation function are studied 
and for the most general case we derive a system of coupled variational equations. An 
approximate analytic solution displays the instability of the system for coupling constants 
beyond a critical value. 



PACS numbers : 11.80.Fv, 11.15.Tk, 11. 10. St 



1 Introduction 



Variational methods have a long history and are still widely used in physics to obtain approx- 
imate non-perturbative solutions. For a very wide class of problems specified by a given set 
of equations it is indeed always possible to construct a variational principle which will give 
an estimate of the quantity of interest correct to first order if the quantities appearing in the 
variational principle are known to zeroth order M. In quantum mechanics the best-known 
variational principle is the Rayleigh-Ritz variational principle for the energy which is applied 
extensively in molecular, atomic and nuclear physics. 

In contrast, the applications of variational principles in quantum field theory are rather 
limited (for a review see Ref. |2|). Within the Hamiltonian formalism several studies exist ( see, 
e.g.,]3], f|] ). The best known covariant example is also a Rayleigh-Ritz variational principle 
which has been formulated in the functional Schrodinger representation H]. It leads to the 
Hartree (-Fock) approximation when a gaussian wave functional is used. Unfortunately the 
latter is the only trial functional which can be used for practical purposes, which drastically 
restricts the power of the variational principle. In addition, in quantum field theory it is 
not the energy of the ground state (vacuum) one is interested in but the energy (mass) of 
excitations. Already in ordinary quantum mechanics this is much harder to obtain. The 
need for renormalization and the infinitely many degrees of freedom add to the "difficulties in 
applying the variational principle to quantum field theory" so that Feynman expressed a rather 
pessimistic view on a workshop devoted to that topic ||. 

It is remarkable that the variational principle works very well in a nonrelativistic field- 
theoretical problem, the polaron (for reviews see |7|, [|, || |10|), but only after the infinitely 
many degrees of freedom for the phonons are integrated out exactly. This gives rise to a 
non-local effective action which Feynman approximated variationally by a retarded quadratic 



action [pT| . Recent exact Monte-Carlo calculations [|12] have again demonstrated that the 
Feynman polaron approximation is the best analytical approximation which works for small as 
well as large coupling constants. Taking the known strong-coupling expansions as a yardstick 
the ground-state energy deviates less than 2.2% and the effective mass (which determines the 
lowest excitations) less than 12% from the exact values. This success can be attributed to the 
reduction in the number of variables and the explicit allowance of retardation in the quadratic 
trial action. Feynman used a specific parametrization for the retardation function but the most 
general form gives only a very small improvement in the ground state energy [T3, TA\. 

Although the Feynman variational principle (or Jensen's inequality in mathematical lan- 
guage) has sometimes been used in field theory (see e.g. fL5|), it was never used in the context 
which made it so successful in the polaron problem: namely, approximating a nonlocal action 
expressed in terms of particle coordinates by a retarded quadratic one. We will do so in the 
present work which is the first in a planned series about variational approximations employ- 
ing the particle representation of field theory. The concept of using particle trajectories as 
dynamical variables in a relativistic quantum theory is an old one: it dates back to the 1937 
paper by Fock [16] who investigated the role of proper time in relativistic equations. In the 
early 50's Nambu |l7j], Feynman and Schwinger [19| made much use of it, but canonical 



"second") quantization later took over and dominated, in particular in the text books (an 



exception is, of course, Ref. p| ). Only a few works [2TL 22, 031 have employed this approach 



in the following years. The renewed interest in the particle representation (see also |23, E5[) is 
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Figure 1: Contour plot of the classical "potential" of Eq. (Q). 



due to superstring-inspired techniques for efficient calculation of one-loop diagrams which have 
been shown to be connected to the ( "first" )-quantized form of field theory |26 . 



For the moment we want to restrict our discussion to scalar field theories. This avoids the 
complications of spin in a path integral, for which there is extensive discussion in the literature 
(see, for example, |?7], |28|). Also, having in mind applications in few-body physics, we take the 
simplest field theory where a light scalar particle (the "pion" ) has a Yukawa coupling to a heavy 
scalar (the "nucleon"). This is the Wick-Cutkosky model which usually is considered 
as a simple model for relativistic bound-state problems treated in the ladder approximation to 
the Bethe-Salpeter equation (see e.g. Ref. f3~lfl , Chapter 10-2). Recently it also has become a 
popular playing field for light-cone techniques 131: 



To be specific we consider the following Lagrangian in euclidean space time 

£ = \ (d^) 2 + + \ (d^) 2 + imV " 9&<P (1) 

where Mo is the bare mass of the heavy particle (which we shall call, for brevity, the "nucleon" ), 
m is the mass of the light particle (the "meson") and g is the (dimensionfull) coupling constant 
of the Yukawa interaction between the two particles. It is well known [[35| that such a coupling 
is equivalent to a $ 3 theory and therefore the ground state of the theory is unstable. This is 
best seen in Fig. [I] which shows a contour plot of the classical "potential" 

W^ 2 -^+WL-^y . ( 2) 



2 u 2m 2 2 V m 2 

The superscript zero reminds us that this is the potential in zeroth order in an expansion in 
powers of h. One-loop quantum corrections modify the behaviour shown in Fig. |l| somewhat, 
but no qualitative change occurs. 
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Figure 2: Cut of the classical potential along the line (||). 

Clearly, the minimum at <£> = <p = is only a local minimum. For positive (p and nonzero <3? 
the "potential" decreases indefinitely. Therefore the 'ground state' sitting near <3> = (p = is 
only metastable, at least in a classical description. From semiclassical descriptions of tunneling 
p6| , |37j we expect the lifetime to depend on the minimum height and thickness of the barrier 
for a given coupling constant. 

Differentiating Eq. (§) with respect to <p we obtain 

^min = (3) 



m? 



and the "potential" along this path 



V^^^ mm ) = hl^ -£- 2 & (4) 

is an inverted double well as shown in Fig. 

We are, however, not genuinely interested in this instability of the 'ground state' in the 
Wick-Cutkosky model. Rather, we want to use it as a field theoretical toy model for the 
dressing of physical nucleons by mesons. The arguments showing the instability of the 'ground 



state' for a scalar "nucleon" do not apply for the case where the nucleons have spin |3q| . In 
other words, the instability is an unwanted side effect of the simplified model considered here 
and we shall ignore it whenever possible. Operationally, we can do this as long as we restrict 
the parameters of the model such that the width of the ground state is small compared to its 
mass. From the above arguments it is clear that this corresponds to sufficiently small couplings. 
Indeed it will turn out, quite reasonably, that the variational equations we shall derive cease 
to have real solutions once the coupling becomes too large; i.e. the formalism itself tells us in 
which region it remains applicable. 

We will study the dressing of a single "nucleon" in the quenched approximation, i.e. ne- 
glecting pair creation of heavy particles which should be a good approximation in low-energy 



4 



processes. In this approximation it is possible to integrate out the mesons exactly and to obtain 
an effective non-local action which is a covariant functional of the particle four-coordinates with 
the proper time as parameter. This effective action bears a surprising similarity to the polaron 
action so that we could even call the dressed particle a "relativistic polaron" . We then perform 
a variational calculation with a quadratic trial action in complete analogy to the polaron case, 
except that we use a covariant description and have to renormalize the mass of the heavy par- 
ticle. Recently Simonov and Tjon ^ have also studied the Wick-Cutkosky model in the 
quenched approximation and in the particle representation. However, their aim was to solve 
the relativistic bound state problem beyond the ladder approximation and they neglected all 
self-energy and vertex corrections. Consequently there is no need for renormalization and no 
sign of the instability in their work. 

This paper is organized as follows: In Section |2| and |3| we respectively derive the effective 
action in the particle representation of the Wick-Cutkosky model and perform the variational 
approximation a la Feynman. The latter is done at the pole of the two-point function. In 
Section |] we discuss different variational ansatze for the retardation function and we set up the 
coupled system of equations which arises when no assumptions are made about the form of the 
retardation function. We study a simple approximate solution of these variational equations 
which displays the instability of the ground state. The main results of this work are summarized 
in the last Section whereas some technical details are relegated to the Appendix. 



2 Effective Action in the Particle Representation 

We begin with the generating functional for the Green functions of the theory, 
Z [J,j] = fv^Vip exp (-51$, <p] + (J, $) + (j, if) ) . 



Here 



denotes the action and we use 



S[$,<p] = J d 4 x C($(x),(p(x)) 
(J, $) = J d 4 x J(x)$(x) etc. 



(5) 
(6) 

(7) 



clS db convenient abbreviation for the source terms. 

Our aim will be to integrate out the mesonic degrees of freedom in order to get an effective 
action for the heavy particles. Indeed, as the meson field ip appears at most quadratically in 
the path integral one could do so immediately, using 



IV exp 



const 



(det D) 1 / 2 

Considering for simplicity the case j = 0, we'd obtain 



exp 



1 



(8) 



J Dp exp 



■-(^(-□ + m 2 )^) + (?(0 2 ,^) 



const 



(det £> ro )V2 



exp 



i m i 



(9) 
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where 



D r 



-□ + 7JT 



(10) 



is the inverse meson propagator. In Eq. (^ the prefactor arising from the gaussian integration is 
independent of the field <3> and the sources and can be absorbed in the (irrelevant) normalization 
factor for the path integral. Therefore the effective action for the heavy field would be given 
by 



-□ 



r$ 2 



(11) 



This is a nonlocal $ 4 -theory whose interaction term has the wrong sign, i.e. this action is not 
bounded from below. This leads to the vacuum instability discussed above for the classical 
limit. To solve the model completely one now would still have to perform a functional integral 
over the heavy field $. Due to the non-gaussian nature of the resulting path integral this is 
impossible to do analytically and one has to resort to approximative methods. 

Given that we want to apply a variational approach, it turns out (as we shall see later) that 
it is actually advantageous to first integrate out the heavy field before doing the same for the 
light field. Although this sounds paradoxical in view of the stated aim, we will reintroduce the 
heavy particle coordinate at a later stage. Applying Eq. (pi), we obtain 



J V<5> exp [--($, (-□ + Ml - 2^)$) + (J, $) 



const 



[det(-D + Ml - 2g<p) 



,1/2 



exp( 



with 



/ [$, J] 




1 



J 



-i A ) 

(12) 
(13) 



-□ + Ml - 2gip 

In contrast to Eq. (Q) the prefactor now explicitly depends on the meson field tp over which have 
to finally integrate. As the determinant is a highly nonlinear and nonlocal object this makes 
an analytical evaluation impossible. However, it is well known that the prefactor describes pair 
production which is greatly suppressed if the mass of these particles is large: 



det(-D + Ml - 2gip) 
const 



det(-D + Ml - 2gcp) 
det(-D + M?) 



det 1 - 2g 



-□ + Ml 



■cp 



1 . 



(14) 



In the following we will adopt this "quenched approximation" and concentrate on the two-point 
function for one nucleon with an arbitrary number of mesons. For this object we then have the 
following generating functional 



Z' [j, x] = 



S 2 Z [J J] 



5J(x) 5J(0) 
JVip <x\ — 



□ 



Ml 



2gtp 



y = > exp 



--(<p,D m <p) + (j,<p) 



(15) 



This obviously describes the propagation of a "nucleon" in the presence of an external field 
gtp{x) over which one has to integrate functionally with a given weighting function. To perform 
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this integration we use a trick due to Schwinger and exponentiate the nucleon propagator 



p 2 + Ml - 2gy{x) 



d(3 exp 



-B(f + M Q 2 -2g V {x)) 



(16) 



dfj,/i is the four-momentum operator. The integration variable (3 usually is called 

. Actually Eq. ([II]) only holds if the 



where p^ 

"fifth parameter" or "proper time" (Refs. f|16|, 17|, 18L |1 



corresponding operator is positive definite which, in general, is not the case since the meson 
field <f(x) can take any values when integrated over functionally. This means that the meson 
fluctuations can become so large that the nucleon locally becomes massless or even tachyonic. 
The correct way to exponentiate therefore would be 



1 



p 2 + Ml - 2g<p(x) - ie 



— I dT exp 
2 Jo P 



ie 



;i7) 



i.e. to introduce Minkowski proper time instead of the euclidean one as in Eq. (jig). We recall 
from the Introduction (see Eqs. (|3|, |j) ) that large meson fields can carry one over the barrier 
and induce the instability of the ground state. Since we want to disregard this instability as 
much as possible and since numerical calculations are much easier in euclidean proper time we 
will nevertheless use Eq. ([TBI) i n the following. However, we should expect a breakdown of this 
description for coupling constants large enough to induce fluctuations over the barrier. 

Even with the proper time representation ( |T6"|) for the nucleon propagator we cannot perform 
the (p integration since the operator p 2 does not commute with the external potential g<p(x). 
However, formally 



U(x, (3; 0, 0) =< x | exp 



y = 0> 



is the matrix element of the euclidean time evolution operator of a non - relativistic particle 
of unit mass [] in the potential g<p(x). Therefore we can express it as a path integral over the 
coordinate x(r) of the particle beginning at x(0) = and ending at x(/3) = x p0| , [40 



U{x,P;Q,0) = / Vxir) exp 

Jx{0)=0 



dr 



\x 2 ~ 9<p{x(t)) 



(19) 



As all quantities in the path integral ( |i~9"D are c-numbers the gaussian ^-integral 



J V(f exp 

can now be performed with the help of Eq 



-(<p,D m <p) + (h,(p) 



Z' [j,x] = const J d/3 exp ( — — M, 




Kv)= j(y) +9 / dTS{y-x{T) 
Jo 

The result is 

x(f3)=x 



(20) 



x(0)=0 



Vx(t) exp(-5' eff [x(t), j] ) 



(21) 



1 A different value should not change physical observables since it only corresponds to a different parametriza- 
tion of the particle path. It can be shown that such a 'reparametrization' invariance holds in our variational 
approximation. The present choice is called the 'proper-time gauge' [p7|| . 
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where the effective action is given by 

S eS [x(r),j] = j^dr h 2 - D m l h) . (22) 

It is convenient to write it in the form 

S cS [x(r),j] = S [x(r)} + S l [x(r)] + S 2 [x(r),j] + S 3 [j] (23) 



with 



S [x(t)] = f dr\x 2 (24) 
Jo 2 

.2 f p 



Si[x(t)] = ~ 9 - I dn I dr 2 <x{r 1 )\D m l \x{T 2 )> (25) 
2 Jo J o 

S 2 [x(t),j] = -gJd 4 yj(y)jyr<y\D^\x(r)> (26) 

^3 [j] = ~\ J d A yi d 4 y 2 j( Vl ) < Vl \ D" 1 \y 2 > j(y 2 ) . (27) 

Note that the last term S3 [j] in the action does not depend on the trajectory x{r) of the nucleon 
and therefore the external meson lines which are generated by differentiating with respect to 
the meson source j are not attached to the nucleon line. Thus the generating functional for 
connected Green functions G 2 ^ n simply is 

Z' conn [j,x] = Z'[j,x]\ s3=Q . (28) 

Compared to the usual procedure via a Legendre transform this simple identification is just 
one of many advantages of field theory in the "particle representation" . Another one is the big 
reduction in degrees of freedom: although in Eq. (|2l|) one still has to do a functional integration, 
it is over 4 functions of one variable (the proper time), whereas the previous field theoretical 
path integral @ is over one function of 4 variables (namely the space-time coordinates). It is 
for this reason that one might expect a variational approach based on particle coordinates to 
be superior to the one based on field variables, given that in both cases only quadratic trial 
actions can be used in practical calculations. 



man 



Eqs. (|24| , p5|) are the relativistic generalization of the retarded polaron action which Feyn- 
TTJ derived when integrating out the phonons from the polaron Hamiltonian. The meson 



propagator may be written as 



d 4 a e iq '^ v ' 1 

< x\ D" 1 \y >= / —\ ^—- 2 , (29) 
(27TJ 4 q z + m A 



and so Eq. (|25|) becomes 



Si[x(r)\ = fdr x fdr 2 f _L_ e <*«<n)-*foO) . (30 ) 
2 Jo Jo J (27r) 4 q l + m l 

Comparing with the polaron action |T^] 

Sf-Mr)] - -^jOjO /g^'^"' (3D 
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one observes a striking similarity. This is even more pronounced when we perform the go- 
integration in Eq. (|30|) which gives 

a 2 f 13 r 13 f d 3 a e -^i\ x o(ri)-xo{T 2 )\ 
S Mr)] = -^r dn / dr 2 \-\- eJq .(x(n)-x(. 2) ) 32 

l07T JO JO J 27T^ UJ q 

with uj q = (q 2 + m 2 ) 1 / 2 . However, there are also some differences which should be noted : 

(i) All coordinates and momenta in S\ in Eq. fl30D , as opposed to Sf olaIon , are four- dimensional 
and therefore Lorentz invariance is explicit. 

(ii) A massive meson propagator enters into the effective action of the Wick-Cutkosky model 
instead of the Coulomb propagator in the polaron problem. 



(iii) The explicitly Lorentz invariant expression for Si ( Eq. (pOf ) ) does not contain a retar- 
dation factor in the proper time, whereas the polaron effective action does because of the 
(normal) time it takes to exchange optical phonons of unit frequency. The 3-dimensional 
version of Si ( Eq. ( |3"i2"D ) does contain a retardation, however, it is not just proportional 
to the proper time difference. 

To maintain explicit covariance we will not use the form (|3^). It is of course also possible to 
fully perform the 4-dimensional q-integration and to obtain 

9 2 f 13 j f 13 j m 



Si[x(r)] = - — / dn / dr 2 — K x ( my(r h r 2 ) ) (33) 

OTT z JO JO y(Ti,T 2 J 



where Ki(x) is a modified Bessel function and 



y(ri, r a ) = - x(r 2 ) f . (34) 

For small relative times Eq. (|33D exhibits a stronger divergence ( 1/y 2 ) than in the polaron 
case {1/y) and requires the usual renormalizations of relativistic field theory. As the Bessel 
function is difficult to handle we will not use this explicit form in the following but rather stick 
to the integral representation in Eq. (|30[) . 

From the derivation presented above it should be clear how the particle representation is 
generalized to N nucleons (the case N = 2 has been considered in Ref. []39| neglecting self- 
energy and vertex corrections): to each heavy particle there corresponds just one trajectory. 
This is due to the quenched approximation which neglects production of heavy pairs. Therefore 
the nucleon number is conserved and no splitting of heavy particle trajectories can occur. 



3 Variational Approximation on the Pole of the Two- 
Point Function 

In this Section we only consider the case where no external mesons are present, which cor- 
responds to simply setting the meson sources j(x) to zero. The exact two-point function (or 
propagator) is then given by 

poo f 3 \ rx(/3)=x 

G 2 (x) = const / dd exp --M 2 / Vx{r) exp(-S , \x(t)\ - S 1 \x(t)} ) . (35) 
Jo \ 2 J Jx(o)=o 
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The normalization constant can be determined by switching off the interaction. In this case 

1 



we know 31 



Ga(p) 



5i=0 



d x exp( ip ■ x) G2(x) 



The correct normalization of Eq. 



therefore is 



Go(x) 



In 2 jo 



1 



dp exp 



-Ml 

2 2(3 



Sl =o P 2 + M o 

J Vx exp(— So — Si 
J Vx exp(— So ) 



where the paths are subject to the boundary conditions 

x(0) = , x(/3) 
Similarly, in momentum space we can write 



x 



G 2 (p) 



dp exp 



J d A x exp(ip ■ x) J Vx exp(— S — Si 



(36) 



(37) 



(38) 



(39) 



J d 4 x exp(ip ■ x) J Vx exp(— S ) 
Due to the nonlinear dependence of the action (BOf) on the paths x{r) it is, of course 



impossible to do the path integrals (37, ^) exactly However, following Feynman [11], it is 
possible to find a variational approximation for the effective action starting from a solvable 
trial action. This variational treatment is based on the decomposition 



S = S t + S - S t = S t + AS 



(40) 



and on Jensen's inequality 

(e~ A5 ) > e- <A5> (41) 

which holds for averages with normalized positive weighting functions. If the weighting function 
is not positive (or even complex), or AS is complex, the inequality in Eq. (|4l| ) is replaced by 
a stationarity with respect to variations 



(42) 



Obviously, Minkowski proper time and/or Minkowski space-time only allows the weaker form 
( fl"2f) to be used. In addition to the choice of the trial action St we also have the freedom how 
we define the averaging, i.e. which coordinates we treat exactly and which only approximately 
via the Jensen stationarity. To be more precise, one can define 



< AS> 



JVx{t)AS[x{t)\ exp(-S t [x(r)]) 



or 



< AS > 5i 



JVx(t) exp(-S t [x(r)]) 
/ d A x exp(ip ■ x) J Vx(t) AS[x(t)} exp(—S t [x(r)]) 



(43) 



(44) 



/ d 4 x exp(ip ■ x) J Vx(t) exp(—S t [x(r)]) 

In the first case, which we will call "coordinate averaging" , one has to do the Fourier transform 
with respect to the endpoint x after the averaging to get the approximate two-point function 
in momentum space whereas in the latter ( "momentum averaging" ) only the integral over the 
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proper time (3 still has to be performed. This is reminiscent of the "partial averaging" procedure 
proposed by Doll et al. and employed in the Monte-Carlo calculations of Ref. [[EJ. It is 
clear that coordinate averaging usually is more accurate and that (with euclidean proper time) 
Jensen's inequality (|4lD can be used. On the other hand momentum averaging more directly 
gives the two-point function in momentum space. We will see that with suitable trial actions 
both averaging procedures lead to identical results on the nucleon pole. 



3.1 Coordinate averaging 

Eq. (|37D may be written in the following form 



1 f°° rn 1 ( P*S2 



where the averaging is performed with respect to the weighting function exp(— So) '■ 

-Si \ _ I Vx ex P(- g o) exp(-gi) 
So JVx exp(-S'o) 

= < exp(S t - S) > St y^z . Q r • 46 

JVx exp(-5 ) 



Here S is the sum of So and Si. Applying Jensen's inequality (Eq. (|4T|)), we find 

< e"* > 5o > exp(- < AS > St ) ^"^"f] . (47) 

JVx exp(-5 ) 

The various path integrals may be easily calculated in Fourier space by parameterizing the 
paths as 

^(r)=x- + E — j . (48) 



This obviously fulfills the boundary conditions Q58]). As only the ratio of path integrals appears 
in Eq. ( [46]) the Jacobian from the transformation to Fourier space cancels and the path integrals 
are now infinite-dimensional integrals over the Fourier coefficients bk for k — 1, ... oo. If one 
writes the endpoint coordinate as 

x= ^2(3 b (49) 

then the free action is simply 

oo 

So = E b l ■ (5°) 

fc=0 

The most general trial action with which one can proceed analytically is one where the b^s 
appear at most quadratically. We shall use 

oo 

5*= J2 A * b l . ( 51 ) 

k=0 

with coefficients > parameterized in various forms (see below) or left free as variational 
parameters. A term like bk ■ bo may also be introduced with only minor complications, while 
off-diagonal terms like bk ■ by would require the calculation of infinite-dimensional determinants. 
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By this choice all path integrals are simple gaussian integrals and can easily be performed. 
We obtain 



and 



< Si >i 



< S -S t > St = (1-A )b 2 + 2^ 



- 1 



,2 .0 



2 Jo 



dT\dT2 



d A q 1 
(2vr) 4 g 2 + m 2 



The last average also involves a (shifted) gaussian integral and is given by 



(52) 
(53) 



< exp [iq ■ (x(n) - ar(r 2 )) ] >& • (54) 



< exp [ iq ■ (x(n) - x(r 2 )) } > St = exp (i Tl - - q. x - - /i 2 (ri, r 2 ) g 2 



(55) 



where we have defined 



fe=l 



A. 



and 



Afc(ri,r 2 ) 



V2 



sm 



— sm 



(56) 



(57) 



We shall postpone a discussion of the meaning of the quantity /j 2 , which plays a crucial role 
in what follows, until later. Finally the g-integration in Q53|) can be performed by using the 
representation 



1 



1 



q 2 + m 2 2 



<iw exp 



u 



\ (q 2 + ™ 2 ) 



(58) 



This gives 



< Si > St = 



^7T 2 JO 



da 



0-a/2 
a/2 



dT 



du 



1 



exp 



M 2 

— m 
2 



(7" 



2 / 3 2 u + /x 2 (a,T) 



(59) 



where we have used the symmetry of the integrand to restrict the proper time integrations to 
r 2 < 7~i and introduced relative and total times 



<? = Ti - r 2 , T = -(n + r 2 ) . 



(60) 



The interaction term can be brought into simpler form by the transformation u —>■ fi 2 / (u + fi 2 ) 
which leads to 



< S 1 >, 



^7T 2 JO 



da 



P-a/2 1 

dT 2 , . 

a/2 /X ^(cr, J ) JO 



c?m e m/i(a, T), 



u 



(61) 
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Here the function e(s, t,u) is defined as 



e(s, t, u) = exp 



■S 2 1 — u 
~2 u 



u 



(62) 



In principle, the w-integral can be expressed in terms of a particular plasma dispersion function, 
the so-called Shkarofsky function fl3l , but there is no advantage of using this representation. 

Hence, using Jensen's inequality and the trial action ((5T|), the Green function in coordinate 
space is bounded by 



G*(x) > 



1 roc 1 / R T 2 \ 

— j Q d{3- exp (-|M 2 - - J exp [- Q({3) - < S, > St ] 



where 



k=l 



lnA k + -j- - 1 



(63) 



(64) 



3.2 Renormalization 

Actually as it stands Eq. (|61~D does not exist, since for small relative times (as we shall see 
later) 



a 



(65) 



causing a logarithmic divergence in the cr-integration^. This is, of course, one of the expected 
divergences of field theory which require renormalization. In the present case, renormalization 
is particularly easy, since only a mass renormalization for the heavy particle is needed. In fact, 
the theory is super-renormalizable in the quenched approximation - only the second-order self- 
energy diagram of the nucleon introduces a divergence. We regulate this with a Pauli-Villars 
regularization. This amounts to subtracting a term with the meson mass replaced by a cut-off 
mass A (which will eventually tend to infinity), thus removing the small a-singularity To be 
specific, we subtract 

- e ( Aver, \fau ,u) (66) 

cr v ' 
from < Si >, where /j,q is an arbitrary mass (renormalization point). Since 







- e ( Ay/a, \/afi , 



U 



U 



- e ( Ay/a, y/a~fi , u 



(67) 



is finite at a = and vanishes for A — > oo the averaged action will be independent of //o- We 
will assume a nonzero meson mass m in most of the following and therefore the most convenient 
choice for us is fio — 0. As shown in the Appendix one then obtains 



< Si > 



Si 



< Si > 



fin 



< Si > r 



(68) 



2 In D dimensions the integrand behaves like a D l 2 1 which in D = 3 leads to the integrable singularity 1/y/o 1 
of the polaron problem. 
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where < Si > fin is the finite part resulting from the subtraction ( pop and is given in Eq. ( |A.9| ). 
The regular part reads 



< Sx > 



reg_ 



in 2 Jo 



P rP-a/2 rl 

da / dT / du 

Ja/2 Jo 



M 2 (a,T) 
1 



e m/i(cx, T), 



xa 



u 



a 



e [my/a, 0, u 



(69) 



From Eq. (|68f) and Eq. (p3|) it is evident that the divergent part of the averaged action can be 
absorbed into a new mass parameter 



M{ = Ml 



3_ 

47T 5 



In 



(70) 



which will be found to be finite. After the bare mass has been replaced by M\ all quantities 
are now well defined. Note that the renormalization (fTTj) is in fact the same as in lowest order 
perturbation theory, even though the calculation has been done in a non-perturbative way. 
Note also that M\ is in general not yet the physical mass of the nucleon but an intermediate 
mass scale with no direct physical meaning. Again, the finite shift from M\ to M p h ys will be 
done in a non-perturbative way. 



3.3 On-mass-shell limit 

The physical mass is determined from the requirement that in momentum space the two-point 
function develops a pole when approaching p 2 = —M 2 hjs : 

G2 (P)^ p 2 + Z M 2 • (71) 

Here < Z < 1 is the residue at the pole. How is it possible that 

/Ayr 2 „oo 
d 4 x e ip - x G 2 (x) = / dxx 2 Mpx) G 2 (x) (72) 
p Jo 

diverges at p — zM p h ys ? Obviously this can only be the case if the large- a; behaviour of Gs(x) 
(which is only a function of x 2 ) is not able to overcome the exponential growth of the Bessel 
function glj 

Ji (iM phys x) =ih (M phys x) x ^ i . (73) 

y/27rM phys x 

Therefore the physical mass is given by 

M phys = - lim - \n(G 2 (x)) . (74) 

x — >oo x 

This is similar to the way the ground-state energy is obtained from the partition function in 
non-relativistic physics or the mass of hadrons in lattice calculations. 
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However, the explicit expression for G2(x) fl6"3| ) contains a term exp(— x 2 /2f3) which would 
decay like a gaussian unless the proper time (3 is proportional to x and also tends to infinity. 
These heuristic arguments suggest that we have to study the limit x, (3 — > oo but keep 



A 



1 x 



M phys (3 



(75) 



fixed. In Eq. ( |75|) the extra factor M ph y S has been introduced to obtain a dimensionless 
quantity f\ From Eq. (^3|) we then obtain 



Go(x) > 



1 M, 



phys 



571"" X 



Jo 



where 



F{x,X) 



Ml 



2AM phys ' 2" phys ' AM phys x 



+ 1 n + i(<5 1 >- + <5 1 > 



In the limit a; — > oo Laplace's method [0, |5[] tells us that Eq. (|7^) behaves like 

x-+oo const 



G 2 (x) > 



x 



3/2 



-xF(Ao) 



(76) 



(77) 



(7f 



where F{Xq) is the minimum of F(x — > oo, A) . Inserting this result into Eq. ([T4]) we obtain 

M phys < F(A ) . (79) 

We have to study the large x- and the large /9-limit of the averaged action. First, we note from 
Eq. ( pOl ) that for /i = 



lim - 



< S 1 > 



fin 



0. 



JO) 



Then we assume that 



lim /i 2 (a,T) = fi 2 (a) 



which holds in all parametrizations which we will study. Therefore 



V = lim 4 < Si > reg = 



87T 2 



d(T / <iw 





1 



( AM phys a N 



— e (m\/a", 0, it) 



52) 



J3) 



has a well-defined limit. We will also assume (and later verify) that fl(f3) defined in Eq. 
has a large-/5 limit 

n = lim n(p) . 

0— >oo 

Suppressing the subscript zero for A we finally arrive at the following inequality for the physical 

mass 

Ml A , ,„ 1 



^phys < 



2A 



2 M p 2 hys + -{n + v) 



4) 



^Recall from Eq. (|16|) that our proper time has dimension (mass) 



15 



(a) 



(b) 



Figure 3: Second-order graphs for the two-point function: (a) self-energy graph, (b): tadpole 
graph. In the quenched approximation the tadpole graph is neglected 



Eq. fl84|) is the main result of this Section. Since M p h ys is fixed we can turn it around and use 



Ml > (2A - A 2 ) M 2 h - 2 (O + V) 



55) 



to maximize the r.h.s with respect to A and all the parameters in the trial action. In the 
following we will call Q the "kinetic" term because it has no explicit coupling constant depen- 
dence and V the "potential" term because it has. In addition, Eq. fl34|) looks like a variational 
equation for the energy in nonrelativistic quantum mechanics. 

Without variation the equality sign in Eq. ( p5| ) gives the perturbative result from the one- 
loop graph shown in Fig. 3 (a). This can be seen as follows: while we expect A = 1 + 0(g 2 ) 
the combination 2A — A 2 is 1 + 0(g 4 ). Similarly, from — 1 + 0(g 2 ), we deduce Q = 0(g A ) 
(see Eq. (|4]) ) and /z 2 (cr) = o + 0(g 2 ). Therefore to lowest order in g 2 we obtain 



Ml 



M 2 phys - 2 V 



A=l 



/j, 2 (a)=a 



+ 0(g 4 



or 



M 2 hys = Ml + " 



47T Z 



du In 



1 + 



"^phys W 

1 - U 



36) 



(87) 



after performing the a-integral. The same result is obtained from the direct calculation of the 
self-energy diagram in Fig. 3 (a) 



S(p 2 



9 2 , A 2 
In — 



4tt 5 



+ 



9 

Air 2 



du In 



p 2 Ml u 



The pole position is determined by M 2 hys 



M 2 + £( 



-M 2 ^ 

phys / 



in lowest order after renormalizing the mass (see Eq. (|70|)). 



from which we obtain Eq. ( {g7D 



3.4 Momentum averaging 

In coordinate averaging the determination of the physical mass was a rather involved procedure. 
This is avoided in "momentum averaging" , where we also average over the endpoint coordinate 
x with the additional weight exp(ip ■ x). This extra weight can be formally absorbed in a 
modified (complex) free action 

Sq = Sq — ip ■ x . (89) 
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In other words, we write Eq. (|39| ) as 



where 



2 Jo 



-% 2 + MZ) 



< e 5l > 



So 



(90) 



/ £>x exp (—So) exp(— Si 
/ exp(— So) 

<exp(Si-S) >g t 



/ exp(— S t 
/X>x exp(-S ' 



Here we have defined 



Vx 



d x j T>x ... 



(91) 



(92) 



Because the weight function is now complex, we can only apply Jensen's stationarity relation 



< e Sl ~ exp ( - < S - S t > 5 



JVx exp(-S t ) 
JVx exp(-S ) 



As trial action we take 



k=0 



where A is an additional variational parameter which rescales the momentum. 

As the evaluation of the various path integrals closely follows the one in Section |3TT 
be brief and just state the results 



/ Vx exp (—St 
JVx exp(-S 



« S -St »5= 2£( — -1) - ^p 2 ^(\ + \A -2A ) 



k=0 



and this time the interaction term is 

.2 r p rP-a/ 2 



« »S t 

Here 



9 

87T 2 Jr, 



da / dT ~2( r \ I 

J a/2 /J '.((J, 1 J JO 



du e (mjl(a, T), 



A /2(<7, T) 



jl 2 (a,T) 



a 



A P 



+ ^(v.T). 



(93) 
(94) 
we can 

(95) 
(96) 

(97) 
(98) 



Renormalization of the averaged action is along the same lines as in the Appendix. Combining 
all terms we obtain the propagator in momentum space 



gm - ^r^ exp H (p2+Mi2)+ 2 p2(i "^ )2 ) 

• exp (- tt(/3) - < Si > rcg - < Si > f 



fin 



(99) 
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where 



9 oo 

m = -a E 

V fc=0 



lnA fc + — - 1 
Ah 



(100) 



and 



< S 1 > reg 



?7T 2 JO 



da 



(9-0-/2 
a/2 



dT I du 



o 



0V, r) 

— e (my/a, 



e m/2(er, T), 



-iXpa 



A fi(a, T) 



u 



u 



(101) 



Because the small cr-behaviour of jl 2 (a,T) is the same as that of /z 2 (cx, T) (see Eq. (|98|) ) we 
have subtracted the same term as before. This explains why the finite part < Si > fin of 
the averaged action is unchanged. 

The on-shell limit of Eq. (p9|) is now particularly easy : a pole develops if in 



1 r 00 

Gz(p) - o / d @ ex P 

2 Jo 



(102) 



the function F 



oo,p 



M 2 phys = Ml + M p 2 hys 1 



vanishes. This leads to 



+ 2 lim 



£l(P) + - « S 1 » rcg 



p=iM phys _ 



(103) 



For any sensible parametrization Aq is finite in the large /5-limit. Therefore the tilde can be 
dropped from ft 2 (u) and Q for large (3 (see Eqs. ( p8| , |100| ) ) and Eq. ( p, 03| ) is completely 
equivalent to Eq. (|84j) if we identify 

A = A A . (104) 

Due to the use of a complex trial action momentum averaging only tells us that the r.h.s. 
of Eq. ( |103| ) is an extremum (and not necessarily a minimum) under variations. Since the 
intermediate mass scale M\ does not show up in any observables this has no direct physical 
consequences. Of course, a minimum principle has the extra advantage that the minimal value 
gives a clear measure of the quality of the variational ansatz. 



4 Variational Ansatze 

Having developed the general formalism for the variational calculation in the last two sections 



we now need to turn our attention to the specific form of the trial action (|5lD . We shall first 
consider two specific parametrizations of the Fourier coefficients A k of this action, followed by 
the best possible parameterization (within the gaussian ansatz) where the actual functional 
form of the A^s is determined by the variational principle. Before we do this, however, it 
is useful to discuss some general features of the trial action. We begin by writing down the 
general quadratic two-time action in coordinate space 



S t [x] = dr -x + dn I dT 2 f(T 1 -r 2 ) [x(n) - x(r 2 )\ (105) 
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where f(r\ — T-i) is an undetermined retardation function. Inserting the Fourier parametrization 
( fi8| ) of the paths we obtain the following expressions for the Fourier coefficients Ak 

AM = 1 + 2^daf(a)a 2 - ^ (106) 

Here we have neglected cross terms of the form bk ■ by , k, k' = 0, 1 ... which are suppressed 
for large (3 [0. It is therefore consistent to also take the large /3-limit of Eqs. ( [L06| , |107[ ). This 
gives 

A M = 1 + ^ / o At /(a) sin 2 , fc = 0, 1 ... (108) 

In the following we will use only this form. Note that in this expression the dependence on (5 
and the number k of the Fourier mode only comes in via the combination 

E = — . (109) 

Writing Ak = A(k7c/j3), in particular Ao = ^4(0) , we therefore have 

8 r°° Kit 
A(E)= 1 + — jf daf{a) sin 2 — . (110) 

Clearly A(E) is even : 

A(-E) = A(E) (111) 

and tends to unity for large E 

A(E) E -^ 1. (112) 

The way how this limit is approached depends on the small- a behaviour of the retardation 
function f(cr). We should emphasize that the trial action which we use is given by 

St = jt A ( k 4) b\ (113) 



k=0 



in Fourier space and not by Eq. ( |105| ) in x-space. However, since one usually has more intuition 
in coordinate space it is useful to deduce general properties and special parametrizations for 
the "profile function" A(E) from the rc-space formulation. 

We are now in a position to express the quantities /i 2 (cr) and Q in terms of A(E). The 
tool to perform the sums over Fourier modes in Eqs. (|56| , [S3l ) is Poisson's summation formula 

+ 0O +00 n+OO 

F(k) = E / dx F(x) e 2i7Tnx (114) 

fc=-oo n=-oo 

which, for an even function F(k7i//3), leads to 

£ F 7T =-/ dEF(E)--F(0) + ^Y, dE F(E) cos(2n/3£) . (115) 

k=l V P ) 71 J ° 2 71 n=l J ° 
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This is an exact form which is much more useful for our purposes than, for example, the Euler- 
MacLaurin summation formula [[11] . The usefulness of Eq. (|115|) comes from the fact that for 
ordinary functions the asymptotic behaviour of the Fourier cosine transformation [IB is given 
by 

F'(0) F"(0) 



dx F(x) cos (2xy) 



(116) 



(2y) 2 {2yY 

Since A(E) is even all odd derivatives at E = will vanish, unless F(x) is singular at x = 0. 
Therefore the asymptotic fall-off of the last term in Eq. ( |115| ) with increasing (3 will not be 
powerlike but in most cases at least exponential. For brevity such terms will be denoted by 



23 ^ f°° 

Ex,- ((3) = — V / dE F(E) cos(2n(3E) 
* n=l J o 



(117) 



where i is an index with which we label the various functions F which occur. Let us first apply 
Poisson's summation formula ( 115 ) to the sum in Eq. fl56|) . Recalling the definitions fl57|) and 
flBD]) we obtain 



/i 2 (a,T) = 8f3j2 



7T JO 



k=l 

oo 



Au k 2 7T 2 



sm 



■ COS 



kirT 



dE 



1 1 - 2 Ea 2 

Sm "2~ C ° S ET 



PA(0) 



+ E Xl {(3) 



:ii8i 



The trigonometric identity cos 2 ET = (1 + cos2ET)/2 allows us to simplify Eq. (|118 ) further: 
again the cosine term only contributes to exponentially small terms so that 



/, 2 (a,T) 



4 /*oc 



7T JO 



dE 



1 sin 2 (Ea/2) a 2 



A(JB) 



E 2 



PA{0) 



Ex 2 (/3) 



(119) 



In this form the limit (3 — > oo is trivial and given by the simple formula 



/i (a) = lim (i (a,T) = - 

/3^oo 7T JO 



sin 2 (E<r/2) 



A(E) E 2 

We further note that because of Eq. ( |112| ) the small a-limit of n 2 is 



(120) 



lim /i 2 (a) 

(7 — >0 



4 

7T JO 



- sin 2 (ffa/2) 
dfi ^ = * 



:i2i^ 



which is what we have used for discussion of the divergences in the averaged action (see Eq. 
(|S5D). The large-a limit is given by 



lim /i 2 (cx) 



4 1 



ttA(0) Jo 



- sin^o/2) 
dE E 2 



a 



A(0) 



(122) 



4 Strictly speaking these terms are exponentially small in T . not j3 . In order to obtain sensible asymptotic 
behaviour for the theory, however, it is necessary for the trial action (|l05| ) to receive its main contribution for 
Ti.2 not too close to the endpoints of the path. Hence T = [t\ + 72)/2 must grow like f3 . 
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Because both the small and large a limit of /i 2 (cr) are proportional to a we shall call it a 
"pseudotime" . 

We now turn to the sum over Fourier modes in Eq. flB3|) . By applying Eq. ( J115 ) one easily 
obtains 



2 r 00 
tt([3) = - dE 

IX Jo 



In A(E) + 



1 



A(E) 



- 1 



1 



lnA(O) + 



1 



A(0) 



- 1 



so that 



Q, = lim 

/3— >oo 



7T JO 



lnA(E) 



A(E) 



+ Ex 3 (/3) (123) 



(124) 



For convergence of the integral A(E) has to approach unity faster than 1/y/E for large E. 



4.1 Feynman parametrization 

In his famous polaron paper, Feynman [II]] chose the retardation function 

f(a) = f F (a) = 



(125) 



with C and w as variational parameters. This was motivated by the exact polaron effective 
action (|31~D, which has an exponential retardation function due to the time it takes for phonons 
to be emitted and reabsorbed by the electron. Furthermore, it may be argued |20| that the 
exponential suppression at large relative times suppresses, at least partially, the increase of the 
quadratic trial action (|105| ) for large x(t\) — xfa). (The exact action obviously goes to zero 
in this limit.) For this reason we will still adopt Eq. (|125|) for the variational approximation 
to the meson- nucleon action (Eq. fl3"0|)) in a first try in this subsection, even though now, of 
course, there is no explicit retardation function in proper time in this action. We will see that 
this allows many calculations to be done analytically. In the next subsections we will consider 
more general trial actions. 

Again following Feynman, we replace the strength C > by a parameter v via 



w 



AC 
+ — . 

w 



It is obvious that v has to be larger than w. From Eq. ( |110| ) we obtain 

v 2 + E 2 



A F (E) 



w 2 + E 2 



(126) 



(127) 



Note that as a function of the complex variable E Feynman's profile function vanishes at 
E = ±iv which in Minkowski space determines the location of the caustics (or focal points) PU 
In addition Ap(E) has poles at E — ±iw . From Eq. (|120| ), we obtain the pseudotime 



w 



v 2 — w 2 



(1- 



:i28) 



V v° 

The limits ( |121| ) and (|122|) can be read off directly from this explicit form. Finally one obtains 

n F = {v - w? (129) 

V 

which is the D = 4 generalization of the polaron result Q. 

5 In the polaron case the kinetic term in the variational expression for the energy is 3(v — w) 2 / iv Jll]] . 
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4.2 An improved retardation function 

The Feynman parametrization outlined above has the advantage that it is extremely simple and 
that many manipulations may be done analytically. It has the disadvantage that for small a it 
exhibits a different behaviour to the true action, which is singular at this point. We shall now 
indicate heuristically how one may arrive at a trial action which does exhibit this singularity 
behaviour. To start off with, we shall add a constant term to the previous action (|105|) : 



d,Ti 



dro 



gin - r 2 ) + /(n - r 2 ) (x(n) - x{t 2 )Y 



(130) 



This should mimic the exact action (|30|) as much as possible. Here we have written the 
constant term (which cancels in the averaging procedure) as a double-time integral over a 
function g(ri — r 2 ). We can determine the functions / and g approximately by requiring that 
on the level of the proper time integrands the momentum averaging of ( |130| ) should be equal 
to the momentum averaging of the exact action. To avoid nonlinear equations we perform 
the averaging with the free action. Using Eqs. ( P7| ) and (58) in the large-/? limit and setting 



A = An 



7~i — r 2 = cr we obtain 



g 2 1 

giv) + f(?) < ( x ( T i) - x(t 2 )) 2 >,§ ~ -— - du e (my/a, -ip y/a, u) . (131) 

07T a Jo x ' 

If we approximate the u-integral by taking the integrand at some u = u we obtain 



g(a)+f(a) « (x(n) - x(r 2 )) 2 » 



So 



8tt 2 (T 



exp 



1 / 2 1 U 2-\ 

— (m p u) a 



Furthermore, as a special case of the general averaging ([55]) we have 



(132) 



< (rc(n) - x(r 2 )) z >g = 4a - <7 2 p 2 



(133) 



which is well known in Brownian motion : at small times the mean square distance in a 
diffusion process grows linearly with the time. Expanding around p 2 - 
coefficients we finally obtain for the retardation function f(a) 



Mp hys and comparing 



32tt 2 a 2 

9 

a 1 



cxp 



LI 



m 



u 



+ M 2 phyB u 



a 



(134) 



The most remarkable feature of the 'improved' retardation function ( 134 ) is that it is singular at 
small relative times and thereby simulates the singular behaviour of the exact effective action. 
Although Eq. ( |134| ) gives explicit values for the constants C and w these should not be taken 
too seriously as they are derived from averaging with the free action. We will only use the form 
of the retardation function as suggested by Eq. (|134p and again treat C and w as variational 
parameters. The resulting profile function is 



ME) = 1 + 



4C" 



arctan ■ 



E 



w 



w 
2E 



In 1 



E 2 



w- 



(135) 
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At large E this falls off only like 

ME) 1 + ^ + (136) 

which reflects the small cr-behaviour of the retardation function. Furthermore, Aj(E) now has 
a branch point at E = ±iw which will become important when we study processes like meson 
production and scattering in subsequent applications. Again, we can eliminate the strength 
parameter C in terms of a parameter v by writing Aj(Q) = v 2 /w 2 . This determines 

C' = ^(v 2 -w 2 ). (137) 

We have been unable to find analytical expressions for /x 2 (cr) and Q with the profile function 
( |135|) . They will be calculated numerically in all the applications which follow. 



4.3 Variational equations 

The optimal choice for the retardation function is obtained if one doesn't restrict its func- 
tional form in the way we have done in the two cases above, but rather determines this form 
through the variational principle itself. In the polaron case this approach was first proposed 
by Adamowski et al. |13| and Saitoh [14|. It corresponds to varying Eq. (|85|) with respect to 
A and the profile function A(E). We first recall from Eq. ( |120| ) that the pseudotime fi 2 (a) can 
be expressed through the profile function by 



„ 2 (a 



4 

7T JO 



dE 



sin 2 (£a/2) 



We may then vary Eq. 



A(E) E 2 
with respect to A. This gives 



;i38) 



2(1 



dV 



0. 



The derivative can be worked out easily (see Eqs. 
for A 



I g 2 f°° a 2 r 1 I 

H t / da / duue \ maia 

87?^ Jo ur cr Jo \ 



2|. pz[) ) and we obtain the implicit equation 
AM phys (T 



A JO JO 

Similarly, the variation with respect to A(E) 

5 



H(<r) 



u 



(139) 



5A(E) 



(fi + V 



gives 



A{E) 



1 + 



9^l_ 

4tt 2 E 2 



da 



sin 2 ( J Bcr/2) 



du 



m 



1 + irVW 



u 



u 



A 2 M 2 hys cr- 



u 



( AMphygCT 

e \ mfi(a), — — , u 



(140) 



23 



where Eq. ( |138[ ) has been used to evaluate 8fi 2 (a)/SA(E). 

Let us discuss some of the aspects of the coupled variational equations (|138[) - (|140|) . We 
first note that we may read off the retardation function, as defined in Eq. ( |11(J| ), from the profile 
function ( |140|) ; it is given by 



32tt 2 /x 4 (a) 



du 



m 2 9 , . 1 — u 
2 u 



A 2 M p 2 hys a 2 



m/i(a) '^T' u 



(141) 

Obviously it has the same l/a 2 -behaviour for small relative times as the 'improved' parametriza- 
tion (|134|) . Furthermore, it should be noted that no renormalization is needed : all integrals 
converge for o — > 0. In addition, the variational equations are also well behaved in the limit 
m — > 0. From Eq. (|139|) we observe that 



< A < 1 



(142) 



always, which allows interpretation of A as a kind of average "velocity" (see Eq. ([75])) in the 
proper time. From Eq. ( 140|) we find that asymptotically 



A VW {E) ^ 1 + 



4tt 2 E 2 jo 

g 2 i 
l + — h 

16irE 



°° , sm 2 (Ea/2) 

da ' + 

a 1 



(143) 



which is consistent with Eq. ( |136| ). Note that while V needs renormalization, Q does not 
because the .E-integral in Eq. QSB1 ) is still convergent with the asymptotic behaviour ( |143| ). 



4.4 Approximate solution of the variational equations 



Although we will present numerical solutions of the above variational equations in the following 
paper, it is very useful to first attempt to derive some approximate analytical results. Because 
the ratio of the pion mass compared to the nucleon mass is small ( m 2 /M 2 hys ~ 0.02), a 
natural approximation to make is to set the pion mass to zero. This is a meaningful thing 
to do because, as we have already noted, the variational equations (|138j) - ( |140j ) are both 
ultraviolet- and infrared- safe. For m — 0, the equation for A becomes 



1 g 2 1 

A ~ 1 + 2^M p \ ys A 4 ,/n 



oo I 

da — 

0~ 



(l+7(<r))e^ (CT) 



and the corresponding equation for the profile function is 



(144) 



A(E) = 1 + 



4tt 2 E 2 



where 



7(a) 



A 2 M p 2 hys a 2 



(145) 



(146) 



Furthermore, as seen in Eqs. ( |121| ) and ( |122| ), the pseudotime /! 2 (a) is proportional to a both 
in the small- and large-a limit. Let us for the moment assume, in order to be able to do the 
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remaining integrals in Eqs. ( |144| ) and ( |145| ) , that the pseudotime is in fact always proportional 
to the relative proper time 

^ ra (147) 



with r < 1. This approximation will be a good one if either the region of small or large a 
dominates the integrals. One can now evaluate all the integrals. Defining the dimensionless 
coupling constant 



a 



El 1 

4vr M p 2 hys 



:i48) 



the variational equation for A ( Eq. |144| ) becomes 



1 

A 



1 + 



a 



nrX 2 



while the variational equation for the profile function yields 



A(E): 

In particular 
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2Enr 2 



arctan 
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AEr 



In 1 + 



2rE , 
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(150) 



(151) 



This is precisely the form of the profile function ( |135| ) obtained with the 'improved' retardation 
function in Section JO, if we identify 



C _ « M phys 



and 



w 



A 2 M p 2 hys 
2r 



(152) 



Solving Eq. (|149|) for A, one obtains 

A p 



1 ± Wl - 



4a 



7rr 



(153) 



This equation has some rather remarkable properties. First of all, it has no real solutions for 
a larger than 

(154) 



7T 



a. 



Below this branchpoint it has two solutions, one approaching A = 1 as the coupling a goes 
to zero, while the other one approaches A = 0. The first of these limits corresponds to the 
perturbative limit (see Section [3~3D , while A = seems unphysical (see Eqs. (0) or (|35l)). 
If one argues that mostly small a-values matter in the respective integrals, i.e. r m I then 



a c w % = 0.785 
4 



(155) 

For a > a c only complex solutions are possible. This is a sign of the instability of the model 
and will be studied in more detail in the following paper. 
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5 Discussion and Summary 



In this work we have introduced a variational approach to relativistic quantum field theory 
which is closely modelled on the very successful treatment of the polaron in condensed matter 
physics. The final aim is to do this for a realistic theory such as QED or a meson-nucleon 
theory. However, there are considerable problems in going from the non-relativistic polaron 
problem to a field theory. So, in order not to be confronted with all complications at once, 
we have chosen to start with a toy theory (the Wick-Cutkosky model) which is not a gauge 
theory and where spin and isospin degrees of freedom are neglected, but where the coupling is 
of a similar Yukawa form as for the more physically relevant theories mentioned above. This 
theory not only has the advantage of relative simplicity, but it also turned out that the action 
is actually extremely similar to the polaron action so that one might expect to have similar 
success by using the same variational treatment as was introduced by Feynman in the polaron 
problem. 

Following this idea we have integrated out the light mesons and represented the heavy 
particles degrees of freedom by trajectories parametrized by the proper time. This step nec- 
essarily required neglect of heavy particle pair production, i.e. the quenched approximation. 
The resulting non-local effective action S e f[ was then approximated variationally by a retarded 
quadratic action S t whose parameters (the "profile function" A(E) and an average "velocity" A) 
have to be determined on the pole of the two-point function. Apart from technical differences 
the Wick-Cutkosky model here again turned out to be very similar to the polaron problem. We 
have introduced two different ways of averaging over the exact action ("coordinate averaging 
and "momentum averaging" ) which gave identical results on the pole of the two-point function. 
In contrast to methods which optimize perturbation theory []48| , [49fl ours is a truly variational 
approach and, as shown in the case of "coordinate averaging" , even a minimum principle. 

However, the model to which we applied our method clearly also has some disadvantages. 
One of the technical differences to the polaron problem is the need of renormalization in a 
relativistic field theory. In this respect the Wick-Cutkosky model is too simplistic: only a mass 
renormalization is needed in the quenched approximation (i.e. the model is superrenormal- 
izable) which certainly is not enough for dealing with the (non-perturbative) renormalization 
of realistic theories. Of more immediate concern, however, is the fact that, unrelated to the 
variational approach as such, the model is unstable. This is of course not a feature of the 
more realistic problems which one is interested in the first place. Luckily, we have been able 
to ignore this instability in so far as that, at least in the variational approach presented here, 
it only starts to manifest itself for couplings larger than some critical coupling. 

Nevertheless, the instability prevents us from comparing the results of the variational cal- 
culation to a strong coupling limit of the theory. This is rather unfortunate, as for the polaron 
the success of the approach could be gauged by the excellent agreement of the variational treat- 
ment with both the strong and weak coupling limits. Here we can only compare with the latter, 
a comparison with the strong coupling limit will have to wait until the method is applied to a 
theory where this limit exists in the first place. Actually, although a stable model would have 
been more welcome, the instability does allow us, through the use of this non-perturbative 
method, to explore the behaviour of the theory around the critical coupling, something which 
one could not do in perturbation theory. 

As was the case for the polaron, the variational calculation contains within it first order 
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perturbation theory, as we have seen by way of example for the self-energy of the heavy particle. 
Importantly this is true for any value of the variational parameters so that agreement with 
the first order perturbative calculation is assured. In the language of perturbation theory, 
variation of the parameters then allows one to effectively sum up parts of higher diagrams up 
to all orders. In principle, the variational approach may be improved systematically by going 
beyond the leading order of the cumulant expansion which is used in Feynman's variational 
principle. In the polaron case this leads to results for the ground state energy and the effective 



mass [M which nearly match the exact Monte-Carlo calculations [12]. In practise, however, 



higher order corrections become increasingly difficult to calculate and so the usefulness of the 
approach depends on how closely the leading orders reflect reality. In particular, the accuracy 
of the zeroth order results (using the first order variational parameters) is of interest. We 
will show in a subsequent paper that already the zeroth order approximation gives a quite 
reasonable description of meson production and scattering processes after analytic continuation 
to Minkowski space. 

An important ingredient of the approach advocated here is to apply the variational principle 
to the action expressed in terms of particle coordinates rather than fields, as has previously 
been done. The reason for doing this is the reduction in the number of degrees of freedom 
which this entails. This is important as one is restricted to generalized quadratic trial actions 
for practical variational calculations. Furthermore, as we have seen, the particle action makes 
extraction of the connected part of a Green function completely trivial. On the other hand, 
one might consider it to be a disadvantage that the action in the particle representation is 
non-local. Although not crucial, there is a certain loss of intuition associated with this. For 
example, in the formulation in terms of fields one may extend the concept of the classical 
potential, and the physical picture which this entails, to higher orders in the coupling through 
the use of the effective potential. Even at the classical level, it is immediately clear by looking 
at the potential in Fig. [j] that the Wick-Cutkosky model is unstable. It is rather difficult to 
see this in the particle representation of the action fl30D . Indeed, even after approximating the 
particle action by the trial action one first had to solve a set of nonlinear coupled equations 
before any signs of the instability manifested itself. Fortunately, we could obtain very good 
approximative results and analytical insight for the solution of the variational equations by 
setting the meson mass to zero and by replacing the "pseudotime" /x 2 (cr) by its limit when the 
relative proper time a tends to zero. The success of this rather drastic approximation indicates 
that to a large extent the dynamical behaviour of this relativistic system is governed by short- 
time processes. Although no substitute for a numerical solution, these analytical expressions 
prove to be rather useful guides to the general behaviour of the solutions. Whether the value 
of the critical coupling is only an artefact of our present quadratic approximation or has some 
physical meaning is not fully clear. In support of the latter view it may argued that the critical 
coupling corresponds to the situation where the average heavy particle field is just large enough 
to overcome the barrier depicted in Fig. |] . 

In conclusion, we think that the variational approach in the form advocated here looks 
rather promising at least for the particular model which we have examined. Not only has 
it provided rather simple analytical expressions which go considerably beyond perturbation 
theory, but it also allows for numerical investigations which will be reported in the following 
paper. We therefore believe that it is certainly worthwhile to apply and extend it to other 
more realistic cases. 
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Appendix : Regularization 

Here we perform the regularization of the averaged action < S\ >s t by subtracting the term 
(Bp) from the cr-integrand. Allowing for an arbitrary subtraction point /xq we then have 



< Si > St 
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We write the quantity in square brackets as 
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and concentrate on the term in the first line which diverges if the cut-off A goes to infinity. 
The term in the second line gives rise to the regular part (|69| ) of the averaged action . For the 
first term we can perform the T-integration immediately since the integrand does not depend 
on T . This gives a factor j3 — a. With the explicit form fl62|) of the function e(s, t, u) we then 
have to evaluate 
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The a - integral can be done in terms of the exponential integral |4T 

/CO 1 
dt - e~ zt . 

For z — > this function behaves like 

E x (z) — > - 7 - In z - O(z) 
where 7 = 0.577215... is Euler's number and for z — > 00 like 
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We easily find 

< Sx > div 



In 2 Jo 



du I (3 
1 - e~ 



ln ^M + El (z Ai(i0 (u)(3) - E^z^u)? 



+ 



*A lW ,(u) 



X _ e - z A, M0 («)/3 



(A. 



29 



In the limit where the cut-off mass A goes to infinity this becomes simpler due to Eq. ( |A.7| ) 
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The above expression for the finite part simplify considerably for /io = and/or (3 — > oo. This 
is what we employ in the main text. Note that for m = we would need /io ^ . 
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